Method for detecting hepatocellular carcinoma

ABSTRACT

The present invention provides an effective method for detecting hepatocellular carcinoma in a tested tissue, which comprises the step of measuring the expression level(s), in the tested tissue, of at least one gene selected from the group consisting of plasminogen gene, EST51549, retinol-binding protein 4 gene and organic anion transporter C gene, and a method comprising the step of measuring, in the tested tissue, the expression level(s) of at least one gene selected from the group consisting of plasminogen gene, EST51549, retinol-binding protein 4 gene and organic anion transporter C gene, and, in the tested tissue, at least one gene selected from the group consisting of aldolase B gene, carbamyl phosphate synthase 1 gene, albumin gene and cytochrome P450 subfamily 2E1 gene.

BACKGROUND OF THE INVENTION

[0001] (1) Field of the Invention

[0002] The present invention relates to a method for detecting hepatocellular carcinoma in which expression levels of genes in a tested tissue collected from chronic hepatitis and other patients are measured.

[0003] (2) Description of the Related Art

[0004] It is believed that there are 500 million patients with viral hepatitis in the world. In South Asia in particular, 24.8% of individuals are infected with hepatitis B virus or hepatitis C virus, and 5% of these individuals suffer from chronic hepatitis. It is known that chronic hepatitis develops to hepatocellular carcinoma over a period of about 20 years.

[0005] Currently, abdominal echography, abdominal MRI, abdominal CT, angiography, biochemical test of tumor marker in blood serum, liver biopsy, etc., are known as methods of detecting and diagnosing hepatocellular carcinoma. However, there is no method that can effectively detect or determine hepatocellular carcinoma by measuring of the expressions of genes.

[0006] Carcinogenesis is one of the phenomena observed when a normal cell is affected by various outside factors and a change or alteration occurs in its genetic level, function of protein is affected by the change or alteration, and the normal cell functions are consequently destroyed. Many works have been reported of changes or alterations at the genetic level occurring in hepatocellular carcinoma tissues.

[0007] For example, it has been reported that the gene amplification of c-myc was in 33.3 to 36.4% of cases in hepatocellular carcinoma tissues (Oncology, 1999, 57, p. 157-163; Journal of Formos Medical Association, 1993, 92, p. 866-870).

[0008] It has been reported that point-mutation occurred in K-ras at a frequency of 0-16.7% (Anticancer Research, 1995, 15, p. 859-861; Oncogene, 1991, 6, p. 857-862).

[0009] It has also been reported that point-mutation occurred in p53 at a frequency of 23.1-50% (Cancer, 1994, 74, p. 30-37; Gastroenterlogy, 1999, 117, p. 154-160; Journal of Hepatology, 1993, 19, p. 312-315; British Journal of Cancer, 1999, 80, p. 59-66; Journal of Gastroenterological Hepatology, 1995, 10, p. 179-185).

[0010] Furthermore, in Rb and p53, loss of heterozygosity was observed at frequencies of 42.9-43.1% and 50-52.9%, respectively (Journal of Hepatology, 1993, 19, p. 312-315; British Journal of Cancer, 1999, 80, p. 59-66; Cancer Research, 1994, 54, p.4177-4182; European Jouranal of Cancer, 1999, 35, p. 1730-1734).

[0011] Underexpression of aldolase B has also been reported (Journal of Clinical laboratory analysis, 1994, 8, p. 144-148) and of albumin (Journal of Histochemistry and Cytochemistry, 1997, 45, p. 79-87).

[0012] It has also been reported that, in a rat liver carcinogenesis model, the expression level of the carbamyl phosphate synthase 1 gene decreases in proportion to the degree of malignancy of cancer (Scientia Sinica Series B, 1988, 31, p. 197-203).

[0013] However, these reports regarding the change or alteration of expression levels of genes are not an adequate basis to determine that precancerous conditions develop to hepatocellular carcinoma, and therefore an appropriate method for detecting hepatocellular carcinoma has not yet been established.

BRIEF SUMMERY OF THE INVENTION

[0014] A primary object of the present invention is to provide an effective method for detecting hepatocellular carcinoma. The invention also aims at providing an effective means for detecting hepatocellular carcinoma.

[0015] The present inventors performed a thorough comparison of expression levels of genes in the livers of chronic hepatitis patients between cancerous regions and noncancerous regions. As a result, they found that there are some genes whose expression levels significantly decrease in the cancerous region. They conducted further extensive research and completed the present invention. The invention relates to a method for detecting hepatocellular carcinoma, a method for early detection of hepatocellular carcinoma, and a DNA chip for detecting hepatocellular carcinoma as described below.

[0016] Item 1. A method for detecting hepatocellular carcinoma comprising the steps of:

[0017] (a) measuring, in a tested tissue, the expression level(s) of at least one gene selected from the group consisting of plasminogen gene, EST51549, retinol-binding protein 4 gene and organic anion transporter C gene; and

[0018] (b) comparing the expression levels of the genes measured in (a) with the expression levels of the genes in a control that correspond to the genes measured in step (a).

[0019] Item 2. A method for detecting hepatocellular carcinoma comprising the steps of:

[0020] (a) measuring, in a tested tissue, the expression level(s) of at least one gene selected from the group consisting of plasminogen gene, EST51549, retinol-binding protein 4 gene and organic anion transporter C gene, and at least one gene selected from the group consisting of aldolase B gene, carbamyl phosphate synthase 1 gene, albumin gene and cytochrome P450 subfamily 2E1 gene; and

[0021] (b) comparing the expression level(s) of gene(s) measured in (a) with the expression levels of genes in a control that correspond to the genes measured in (a).

[0022] Item 3. A method for detecting hepatocellular carcinoma according to any one of Items 1 or 2, wherein the step (a) of measuring the expression level(s) of the gene(s) is performed by determining the amount of transcripts of the genes being measured.

[0023] Item 4. A method for detecting hepatocellular carcinoma according to any one of Items 1 or 2, wherein the step (a) of measuring the expression level(s) of the gene(s) is performed by amplifying whole or a part of the DNA to be measured and using cDNA prepared from gene transcripts as a template.

[0024] Item 5. A method for detecting hepatocellular carcinoma according to any one of Items 1 to 3, wherein the step (a) of measuring the expression level(s) of the gene(s) is performed by invader assay.

[0025] Item 6. A method for detecting hepatocellular carcinoma according to any one of Items 1 to 2, wherein the step (a) of measuring the expression level(s) of the gene(s) is performed by hybridizing labeled cDNA prepared from transcripts including the gene(s) to be measured with whole or a part of the immobilized DNA of the gene(s) to be measured.

[0026] Item 7. A method for detecting hepatocellular carcinoma according to any one of Items 1 to 6, wherein the tested tissue in the step (a) is liver tissue of a chronic hepatitis patient.

[0027] Item 8. A method for detecting hepatocellular carcinoma at an early stage that comprises the step of periodically measuring the expression level(s), in a tested tissue, of at least one gene selected from the group consisting of plasminogen gene, EST51549, retinol-binding protein 4 gene and organic anion transporter C gene.

[0028] Item 9. A method for detecting hepatocellular carcinoma at an early stage that comprises the step of periodically measuring the expression level(s), in a tested tissue, of at least one gene selected from the group consisting of plasminogen gene, EST51549, retinol-binding protein 4 gene and organic anion transporter C gene, and at least one gene selected from the group consisting of aldolase B gene, carbamyl phosphate synthase 1 gene, albumin gene and cytochrome P450 subfamily 2E1 gene.

[0029] Item 10. A DNA chip for detecting hepatocellular carcinoma in which whole or a part of DNA comprising transcribed region(s) of at least one gene selected from the group consisting of plasminogen gene, EST51549, retinol-binding protein 4 gene and organic anion transporter C gene is immobilized.

[0030] Item 11. A DNA chip for detecting hepatocellular carcinoma in which whole or a part of DNA, in a tested tissue, comprising transcribed region(s) of at least one gene selected from the group consisting of plasminogen gene, EST51549, retinol-binding protein 4 gene and organic anion transporter C gene, and, at least one gene selected from the group consisting of aldolase B gene, carbamyl phosphate synthase 1 gene, albumin gene and cytochrome P450 subfamily 2E1 gene.

BRIEF DESCRIPTION OF SEVERAL VIEWS OF THE DRAWING

[0031]FIG. 1 compares the expression levels of genes in hepatocellular carcinoma tissues and noncancerous tissues after conducting electrophoresis.

[0032] GTVA and GTVC indicate anchor primers, AP indicates an arbitrary primer. A to D represent tested patients, in which A and B are patients infected with the hepatitis B virus, and C and D are patients infected with the hepatitis C virus.

[0033] Lane N and lane T are electrophoresis patterns of samples prepared from noncancerous tissue and cancerous tissue respectively. M is a molecular marker.

[0034] Arrows point the genetic bands after conducting electrophoresis in which expression levels of genes are lower in hepatocellular carcinoma tissue than in the control.

DETAILED DESCRIPTION OF THE INVENTION

[0035] Hereunder, the present invention is explained in detail.

[0036] Representation of amino acids, peptides, base sequences, nucleotides, etc., by abbreviations in this specification is in conformity with the rules recommended by IUPAC-IUB, “Guideline for Preparation of a Specification or Equivalent Referring to a Base Sequence and/or an Amino Acid Sequence”(edited by the Japan Patent Office) and the conventions relating to the use of codes or symbols in the art.

[0037] (1) Genes Whose Degree of Expression is Lowered By Canceration

[0038] The present inventors performed a thorough comparison of expression levels of genes in the livers of chronic hepatitis patients between cancerous regions and noncancerous regions.

[0039] Specifically, a fluorescent-labeled cDNA library was synthesized from mRNA that had been prepared using a cancerous region and a noncancerous region of the liver of a chronic hepatitis patient, and then the library was subjected to separation by electrophoresis. The variance in the intensity of fluorescence between two tissues were examined, and genes significantly underexpressed in the cancerous region were selected as potential genes useful for detecting hepatocellular carcinoma (FIG. 1). The potential genes were then cloned to determined their base sequences. Furthermore, using a mRNA solution prepared from a cancerous region and a noncancerous region, the potential genes were quantified by a real-time RT-PCR method to confirm that they were actually underexpressed in the hepatocellular carcinoma.

[0040] Eight genes that were significantly underexpressed in hepatocellular carcinoma were selected.

[0041] The 8 genes were analyzed using the GenBank gene database, and it became clear that the 8 genes had the base sequences of Seq. Nos. 1 to 8.

[0042] The gene having the base sequence of Seq. No. 1 is a gene that codes aldolase B.

[0043] The gene having the base sequence of Seq. No. 2 is a gene that codes carbamyl phosphate synthase 1.

[0044] The gene having the base sequence of Seq. No. 3 is a gene that codes plasminogen.

[0045] The gene having the base sequence of Seq. No. 4 is EST51549 (GenBank Acc. No. AA345522) whose function is unknown.

[0046] The gene having the base sequence of Seq. No. 5 is a gene that codes albumin.

[0047] The gene having the base sequence of Seq. No. 6 is a gene that codes cytochrome P450 subfamily 2E1.

[0048] The gene having the base sequence of Seq. No. 7 is a gene that codes retinol-binding protein 4.

[0049] The gene having the base sequence of Seq. No. 8 is a gene that codes organic anion transporter C.

[0050] As shown in Table 1, the 8 genes were remarkably underexpressed in hepatocellular carcinoma. In particular, underexpression of plasminogen gene, EST51549, retinol-binding protein 4 gene and organic anion transporter C gene in hepatocellular carcinoma is a new finding of the present invention.

[0051] (2) Detection Method

[0052] The detection method of the present invention is characterized in that it comprises the step of measuring the expression levels of the genes that exhibit underexpression in hepatocellular carcinoma in a tested tissue.

[0053] As described above, the 8 genes exhibit remarkably lowered expression levels in cancerous tissue developed from hepatocellular carcinoma. Therefore, by measuring the expression levels of these genes and comparing with a control, it becomes possible to detect hepatocellular carcinoma.

[0054] In the present invention, measurement of expression level(s) of, in particular, at least one member selected from the group of 4 genes consisting of plasminogen gene, EST51549, retinol-binding protein 4 gene and organic anion transporter C gene is conducted. The measurement may be conducted by measuring the expression level of one gene out of the 4 genes, more than one of the 4 genes, or all of the 4 genes.

[0055] In the present invention, it is preferable that, in addition to the above-mentioned 4 genes, the expression level(s) of at least one gene selected from the group consisting of the four further genes, i.e., aldolase B gene, carbamyl phosphate synthase 1 gene, albumin gene and cytochrome P450 subfamily 2E1 gene be measured.

[0056] In addition to measuring the expression level(s) of at least one gene selected from the four genes consisting of plasminogen gene, EST51549, retinol-binding protein 4 gene and organic anion transporter C gene, by further measuring the expression level(s) of at least one gene selected from the group consisting of aldolase B gene, carbamyl phosphate synthase 1 gene, albumin gene and cytochrome P450 subfamily 2E1 gene, it becomes possible to conduct measurement or detection in a more accurate manner.

[0057] In particular, it is preferable to measure the expression levels of all 8 genes, i.e., plasminogen gene, EST51549, retinol-binding protein 4 gene, organic anion transporter C gene, aldolase B gene, carbamyl phosphate synthase 1 gene, albumin gene and cytochrome P450 subfamily 2E1 gene.

[0058] In the detection method of the present invention, as long as the measurement of the expression level(s) of at least one member selected from the group of 4 genes consisting of plasminogen gene, EST51549, retinol-binding protein 4 gene and organic anion transporter C gene is included, measurement of the expression levels of publicly known genes other than the above-mentioned 8 genes can be included.

[0059] The method for measuring the expression levels of genes is not particularly limited and conventional methods can be suitably used. For example, it is possible to employ a method in which the amount of transcript is determined or a method in which the amount of translated products is determined.

[0060] The method for determining the amount of transcript from a gene is such that mRNA is extracted from the tested tissue to determine the amount of the RNA products derived from the gene.

[0061] Extraction of RNA from the tested tissue and purification can be conducted by following conventional methods. Specifically, it can be conducted as follows: To the tested tissue, a solution containing phenol and guanidine thiocyanate is added. After dissolving or homogenizing, chloroform is added thereto, and the solution is then separated by centrifugation into an aqueous solution layer, which is the upper layer, and an organic layer, which is the lower layer. The RNA is dissolved in the aqueous layer, and therefore RNA can be recovered by collecting only the upper layer. By adding a lower alcohol, such as isopropanol, to the collected solution to precipitate RNA, after washing, RNA of high purity can be obtained. The extracted RNA can be used as total RNA or as purified mRNA.

[0062] Various methods can be employed to determine the amount of the extracted RNA. For example, RT-PCR, real-time RT-PCR, invader assay, DNA chip, Northern blot analysis, etc., can be employed.

[0063] The RT-PCR and real-time RT-PCR are methods in which complementary DNA (cDNA) is synthesized from mRNA, and DNA in the object region is synthesized using a suitable primer and DNA polymerase (generally, heat resistant DNA polymerase). Generally, the amount of RNA is measured after amplifying DNA by repeating denaturation, annealing and elongation of DNA.

[0064] The primers used in RT-PCR or real-time RT-PCR may use any base sequence region, as long as they comprise regions that can specifically amplify the object genes. The length of the base sequence is not particularly limited. Generally, base sequences having a length of from 20 to 30 nucleotides are used.

[0065] CYBR Green, PicoGreen, ethidium bromide, etc., are used as fluorescent molecules having affinity with DNA strands used in real-time RT-PCR. CYBR Green is preferably used.

[0066] RT-PCR and real-time RT-PCR are preferably employed because they can amplify DNA to several 100,000 times, are highly sensitive, and only a small amount of test sample is required.

[0067] The invader assay is a method comprising the steps of: on RNA or DNA, by linking a probe (invader probe) that is complementary to the RNA or DNA and a complementary probe (signal probe) that has a noncomplementary region at the 5′-end, cutting the signal probe by a Cleavase enzyme that recognizes the conformation; and measuring the expression levels of genes by detecting any fragments of the cut off signal probe.

[0068] As an invader probe, it is possible to use a probe that is homologous to the object transcript and there is no particular limitation to its base sequence length. There is no limitation to the length, etc. of the base sequence of the signal probe, as long as it has a base sequence that forms a triple strand structure with the invader probe on the transcript; specifically, a probe that has a noncomplementary region at the 5′-end and a complementary region at the 3′-end, and Cleavase can recognize the conformation and cut the signal probe off. The method for measuring or detecting the cut off fragment of the signal probe is such that, for example, when the fragment of the signal probe is a noncomplementary region of the transcript, the fragment with the fluorescent labeled probe is cut off by Cleavase and the fluorescent signal of the obtained fragment is measured. There is no limitation to the fluorescent labeled probe used in this case as long as it is a DNA probe having a base sequence complementary in a portion to the fragment of the signal probe, and, when the fragment is hybridized, a conformation recognizable by Cleavase as described above is formed, wherein a luminous material labels the cut off portion and a quenching material labels the uncut portion, and luminous signals are not emitted when it is in an uncut condition. As the luminous material, generally a fluorescent material, a phosphorescent material, etc., are preferably used. As the quenching substance, Cy3, etc., are preferably used.

[0069] When the signal probe fragment has a region that is complementary to the transcript, for example, it is possible to exemplify the method such that an immobilized oligonucleotide having a region complementary to the fragment is made to bind the separated fragment, and the fragment is detected by a fluorescence antibody method using fluorescein as a fluorescent pigment. This can be conducted using commercially available measurement kits, etc.

[0070] The invader assay is preferably employed because the probe itself does not have to be labeled and an amplification operation is unnecessary.

[0071] A DNA chip is explained in detail in the section entitled (5) DNA chip.

[0072] The method for determining the amount of translated products of gene is performed by quantifying the protein coded by the object gene. Specifically, it is possible to exemplify a method such that the amount of protein coded by the object gene is determined by employing an immunoassay using an antibody that can specifically recognize proteins. As the immunoassay, Western blot analysis, radioimmunoassay, ELISA, etc., can be exemplified.

[0073] In the detection method of the present invention, expression levels of genes in the tested tissue measured by the above-descried methods are compared to those of the corresponding genes in a control.

[0074] Specifically, (a), in the tested tissue, the expression level(s) of at least one gene selected from the group consisting of plasminogen gene, EST51549, retinol-binding protein 4 gene and organic anion transporter C gene is measured; and

[0075] (b) the expression level(s) of the genes measured in (a) is compared to the expression levels of the genes in the control that correspond to the gene(s) measured in (a).

[0076] Alternatively, (a), in the tested tissue, the expression level(s) of at least one gene selected from the group consisting of plasminogen gene, EST51549, retinol-binding protein 4 gene and organic anion transporter C gene and the expression level(s) of at least one gene selected from the group consisting of aldolase B gene, carbamyl phosphate synthase 1 gene, albumin gene, cytochrome P450 subfamily 2E1 gene are measured, and

[0077] (b) the expression levels of the genes measured in (a) are compared to the expression levels of the genes in the control that correspond to the genes measured in (a).

[0078] The tissue used as the control can be suitably selected depending on the means employed in the detection method or the purpose of detection. Specifically, tissue from nonpatients, noncancerous regions of hepatic tissue of chronic hepatitis patients, human peripheral blood mononuclear cells of nonpatients, etc., can be used.

[0079] When the expression levels of genes in the tested tissues are lower than the expression levels of corresponding genes in the control, it is assessed that the tested tissue is of hepatocellular carcinoma or the tested tissue includes hepatocellular carcinoma, and therefore hepatocellular carcinoma can be detected.

[0080] (3) Process for the Judgment

[0081] By measuring the expression levels of the above-mentioned 8 genes, it is possible to determine if a tested tissue has hepatocellular carcinoma and to assess the malignancy of the hepatocellular carcinoma, etc.

[0082] Specifically, it can be conducted by following the procedure as below:

[0083] First, in the tested tissue, the expression level(s) of at least one gene selected from the group consisting of plasminogen gene, EST51549, retinol-binding protein 4 gene and organic anion transporter C gene is measured.

[0084] Alternatively, in the tested tissue, the expression level(s) of at least one gene selected from the group consisting of plasminogen gene, EST51549, retinol-binding protein 4 gene and organic anion transporter C gene and the expression level(s) of at least one gene selected from the group consisting of aldolase B gene, carbamyl phosphate synthase 1 gene, albumin gene and cytochrome P450 subfamily 2E1 gene are measured.

[0085] The expression levels of the genes in the tested tissue are then compared to the expression levels of corresponding genes in the control.

[0086] When the expression levels of genes of the tested tissue are lower than those of the genes in the control, it is assumed that the tested tissue has hepatocellular carcinoma or there is a high possibility that hepatocellular carcinoma or like cancer cell is included in the tested tissue. It is also possible to assess the malignancy of the cancer based on the degree of underexpression.

[0087] This assessment method can be utilized in diagnosis or treatment of hepatitis patients.

[0088] (4) Early Detection Method

[0089] By periodically measuring the expression levels of the 8 genes, early detection of hepatocellular carcinoma or a region having a high possibility of developing into hepatocellular carcinoma becomes possible.

[0090] Specifically, it can be conducted by following the procedure as below:

[0091] First, in the tested tissue, the expression level(s) of at least one gene selected from the group consisting of plasminogen gene, EST51549, retinol-binding protein 4 gene and organic anion transporter C gene is periodically measured.

[0092] Alternatively, in the tested tissue, the expression level(s) of at least one gene selected from the group consisting of plasminogen gene, EST51549, retinol-binding protein 4 gene and organic anion transporter C gene and the expression level(s) of at least one gene selected from the group consisting of aldolase B gene, carbamyl phosphate synthase 1 gene, albumin gene and cytochrome P450 subfamily 2E1 gene are periodically measured.

[0093] As a result of the periodic measurement, when underexpression compared to a previously measured expression level is observed, it is possible to assess the tested tissue as having hepatocellular carcinoma or having a region highly possible to develop to hepatocellular carcinoma.

[0094] By periodically measuring the change in the expression levels of the specific genes, it is possible to detect occurrence or development of hepatocellular carcinoma at an early stage.

[0095] The measuring period, i.e., the duration of the period between measurements, can be suitably selected depending on the condition of the patient or tested individual. For example, periodic measurement can be performed once per a half-year or once per a year.

[0096] The early detection of the present invention can be used for preventing or treating hepatocellular carcinoma, or analysis for prognosis of a hepatitis patient.

[0097] (5) DNA Chip

[0098] The detection method, process for the judgment, and early detection method can effectively conducted by using a DNA chip in which whole or a part of DNA comprising transcribed region(s) of the gene(s) to be measured, i.e., at least one gene in the tested tissue selected from the group consisting of plasminogen gene, EST51549, retinol-binding protein 4 gene and organic anion transporter C gene, is immobilized. Furthermore, the method can be more effectively conducted by using a DNA chip in which whole or a part of DNA comprising transcribed regions of, in addition to the above-mentioned genes, at least one gene in the tested tissue selected from the group consisting of aldolase B gene, carbamyl phosphate synthase 1 gene, albumin gene and cytochrome P450 subfamily 2E1 gene, is immobilized.

[0099] Among those types in which DNAs of a gene are immobilized on a surface, some are categorized as DNA arrays. The DNA arrays can be grouped into DNA microarrays and DNA macroarrays. The DNA chip of the present invention includes these so-called DNA arrays (including DNA microarrays and DNA macroarrays).

[0100] The DNA chip of the present invention can be produced by synthesizing whole or a part of the DNA comprising the transcribed regions of the genes to be measured by employing a conventional method, and immobilizing the DNA on a support or directly synthesizing it on a support).

[0101] There is no limitation to the support (or the surface) as long as it can immobilize DNA. For example, a silicon chip, a glass slide, a nylon membrane, etc., can be used.

[0102] There is no limitation to the immobilization method as long as it is a generally used method. Methods in which DNA is spotted using a spotter, an arrayer, etc., or in which synthesis of nucleotides is sequentially performed on a support, etc., are preferably employed.

[0103] There is no limitation to the region and the length of the base sequence of the DNA immobilized on the support as long as it specifically hybridizes with a labeled cDNA prepared from transcripts of the above-mentioned 8 genes.

[0104] For example, PCR products based on cDNA prepared from the transcripts of the 8 genes, synthesized oligonucleotides or their partial fragments prepared in accordance with the base sequences of the transcribed regions of the 8 genes, etc., can be preferably used.

[0105] The DNA chip of the invention is, specifically, produced by immobilizing whole or a part of the DNA comprising a transcribed region of at least one gene selected from the group consisting of plasminogen gene, EST51549, retinol-binding protein 4 gene and organic anion transporter C gene.

[0106] The DNA chip of the invention can also be produced by immobilizing whole or a part of the DNA, in the tested tissue comprising transcribed regions of at least one gene selected from the group consisting of plasminogen gene, EST51549, retinol-binding protein 4 gene and organic anion transporter C gene, and, at least one gene selected from the group consisting of aldolase B gene, carbamyl phosphate synthase 1 gene, albumin gene and cytochrome P450 subfamily 2E1 gene.

[0107] In the DNA chip of the present invention, it is possible to further immobilize, if desired, whole or a part of DNA comprising transcribed region(s) of known genes other than the above 8 genes, synthesized nucleotides or their fragments, etc.

[0108] The DNA chip of the present invention can be used for detecting. hepatocellular carcinoma. To be more specific, it can be used for diagnosing hepatocellular carcinoma or detecting hepatocellular carcinoma at an early stage.

[0109] Specifically, the DNA chip of the invention can be used in the following manner:

[0110] From the tested tissue, the transcripts of at least one gene selected from the group consisting of plasminogen gene, EST51549, retinol-binding protein 4 gene and organic anion transporter C gene, and at least one gene selected from the group consisting of aldolase B gene, carbamyl phosphate synthase 1 gene, albumin gene and cytochrome P450 subfamily 2E1 gene are extracted. By hybridizing the labeled cDNA prepared from the transcripts of the genes on the DNA chip of the invention, the expression levels of the genes can be measured. The measurement for detecting hepatocellular carcinoma according to the invention can be thereby performed.

[0111] cDNAs prepared from the transcripts of the tested tissue and the control are each labeled by different colorants, such as Fluorescein (green), Phycoerythrin (red), a substance having biotin added to Fluorescein or Phycoerythrin, Cy3-deoxyuridine triphosphate, dUTP, and Cy5-dUTP. The cDNAs are linked to the DNA chip and the difference in the intensity of fluorescence is processed by computer. By numerically expressing the degree of underexpression of the targeted genes in the tested tissue compared to those in the control, the method for detecting hepatoma cells of the present invention can be conducted. During this step, it is also possible to detect hepatoma cells by visually identifying the differences in color using a fluorescence microscope.

[0112] The operations employed in the detection method, process for the judgment, early detection method of the invention, or production of the DNA chip, for example, chemical synthesis, cutting, removing, linking or adding of DNA, and isolation, purification, amplification or reproduction of enzymes used for synthesizing cDNA of gene transcripts or transcripts of genes, etc., can be conducted by methods that were known before the filing date of the present application. Determination or confirmation of base sequences can be performed by, for example, the dideoxy method or Maxam-Gilbert method.

EXAMPLES

[0113] The present invention is explained below in further detail with reference to Examples. However, the scope of the invention is not limited to these Examples.

[0114] 1. Determination of a gene that is underexpressed in hepatocellular carcinoma.

[0115] Cells in a hepatocellular carcinom lesion of the liver tissue of chronic viral hepatitis patients (two patients infected with HBV and two patients with HCV) and cells in a noncancerous region of the same liver tissue were used as samples. Total RNAs were extracted from the cancerous tissue and noncancerous tissue surgically resected from hepatitis patients. To 1 μg of total RNAs, ROX-fluorescent-labeled 3′-anchored oligo-dT (oligo-dT; GT15MG, GT15MA, GT15MT, GT15MC, where M represents a mixture of G, A and C, synthesized by Greiner Labortechnik Japan/Japan, 50 pmol in 11 μl of diethylpyrocarbonate-treated water) was added and heated at 70° C. for 10 minutes. Solution A having the following composition was then added thereto to obtain a final volume of 20 μl.

[0116] (Composition of Solution A)

[0117] 4 μl of 5 × first-strand buffer (0.25 M tris-HCl, pH7.5; 0.375 mol/L KCl; 0.05 mol/L dithiothreitol and 0.015 mol/L MgCl₂) 2 μl of 0.1 mol/L dithiothreitol, 1 μl of 2.5 mmol/l deoxynucleotide triphosphates (dNTPs), 1 μl of ribonuclease inhibitor (40 units; Wako Pure Chemical Industries, Japan) and 1 μl of superscript II reverse transcriptase (200 units; BRL, USA).

[0118] The RNA solution was incubated at 42° C. for one hour to synthesize cDNA and then diluted 5-fold by the addition of 80 μl of diethylpyrocarbonate-treated water. Using the resulting cDNA as a temlate, amplification of the object genes was conducted by PCR. The added reagents and the reaction conditions were as follows:

[0119] Added reagents: 2 μl of reaction solution, 2 μl of 10×PCR buffer (100 mmol/L Tris-HCl, 15 mmol/L MgCl₂, 500 mmol/L KCl and 1 mg/ml geratin, pH8.5), 1.6 μl of 2.5 mmol/L dNTPs, 0.2 μl of Taq DNA polymerase (5 units/μl; Roche Molecular Systems, NJ), 5 pmol of ROX-fluorescent-labeled 3′-anchored oligo-dT primer and 10 pmol of ROX-fluorescent-labeled 5′-anchored oligo-dT primer.

[0120] Reaction conditions: one cycle of 3 minutes at 95° C., 5 minutes at 40° C., and 5 minutes at 72 C.; then 2 to 40 cycles of 30 seconds at 95° C., 2 minutes at 40° C., and 5 minutes at 72° C.

[0121] Each reaction solution prepared from the cancerous tissue and noncancerous tissue as described above was electrophoresed on 6% polyacrylamide gel containing 7.5 M urea. Using an FM BIO II imaging analyzer (Takara Holdings Inc.), the expression levels of genes were analyzed. As a result, it was found that there was a difference in intensity of fluorescence between the genes in noncancerous tissue and cancerous tissue and a plurality of genes (shown by arrows) apparently underexpressed in the cancerous tissue (FIG. 1). The bands showing differences in intensity of fluorescence were cut and the fragments were immersed in a 100 μl of TE (Tris-HCl, EDTA) buffer for one hour and DNAs were extracted. Thereafter, reamplification was conducted by PCR using the extract solution as a template. The reaction conditions were the same as those in the first PCR.

[0122] The reamplified PCR products were electrophoresed on 3% agarose gel, the bands thereof were cut, and recovered using GFX PCR DNA and Gel Band Purification Kit (Amersham Pharmacia Biotech, NJ). The DNAs of the reamplified PCR products were cloned using the cloning vector pCRII (Invitrogen Japan, Japan), and strands of DNAs were sequenced using an ABI377 (Applied Biosystems, USA).

[0123] As the result of the above operation, 8 genes that show significantly decreased expression levels were identified.

[0124] The nucleotide sequences of the 8 genes were analyzed using the GenBank database and it was determined that these 8 genes had the nucleotide sequences of Seq. Nos. 1 to 8. In other words, they are 8 genes, namely aldolase B gene, carbamyl phosphate synthase 1 gene, plasminogen gene, EST51549, albumin gene, cytochrome P450 subfamily 2E1 gene, retinol-binding protein 4 gene and organic anion transporter C gene.

[0125] 2. Measurement of expression levels of genes in a cancerous cell underexpressed in hepatocellular carcinoma Real-time RT-PCR was conducted based on total RNAs prepared by extracting from chronic hepatitis patients derived hepatocellular carcinoma tissue (samples surgically obtained from 20 patients each with chronic hepatitis).

[0126] Specifically, amplification reaction was performed using a 20 μl of total RNA solution containing 2 μl of 10× reaction buffer (Taq polymerase, dNTP, MgCl₂ and CYBR Green fluorescent (Roche Diagnostics) and 2 μl of template cDNA with each oligonucleotide primer.

[0127] Reaction conditions: 40 cycles of 10 seconds at 95° C., 10 seconds at 65° C. and 30 seconds at 72° C. As a PCR amplifier, Light Cycler (Roche Diagnostics, Germany) was used.

[0128] The measured expressions were compared to those of the noncancerous tissue of the same chronic hepatitis patient. Table 1 shows the results. TABLE 1 Hepatocellular carcinoma patients with underexpression over 50% GenBank HBV (−), HCV (−) HBV (+), HCV (−) HBV (−), HCV (+) total Genes ACC. No. (n = 2) (n = 3) (n = 15) (n = 20) Aldolase B gene X02747 2/2 (100%) 2/3 (66.7%) 14/15 (93.3%) 18/20 (90.0%) Carbamyl phosphate synthase D90282 2/2 (100%) 2/3 (66.7%) 11/15 (73.3%) 15/20 (75.0%) I gene Phasminogen gene X05199 2/2 (100%) 2/3 (66.7%) 11/15 (73.3%) 15/20 (75.0%) EST51549 AA345522 2/2 (100%) 2/3 (66.7%) 11/15 (73.3%) 15/20 (75.0%) Albumin gene V00495 2/2 (100%) 1/3 (33.3%) 12/15 (80.0%) 15/20 (75.0%) Cytochrome P450 subfamily J02843 2/2 (100%) 1/3 (33.3%) 10/15 (66.7%) 13/20 (65.0%) 2E1 gene Retinol-binding protein 4 gene X00129 2/2 (100%) 1/3 (33.3%)  9/15 (60.0%) 12/20 (60.0%) Organic anion transporter AB026257 2/2 (100%) 1/3 (33.3%)  8/15 (53.3%) 11/20 (55.0%) C gene

[0129] In table 1, HBV indicates hepatitis B virus and HCV indicates hepatitis C virus. (+) indicates that the patient was infected by the virus and (−) indicates that the patient was not infected by the virus.

[0130] In the fractions of table 1, the numerators express the number of patients with underexpression over 50% and the denominators express the total number of tested patients. The percentage numbers in ( ) indicate the percentage ratio of the number of patients with underexpression over 50% to the total number of the tested patients.

[0131] As shown in table 1, regardless of the type of chronic hepatitis, patients with hepatocellular carcinoma showed a significant decrease, i.e., over 50%, in the expression levels of the 8 genes.

[0132] As described above, aldolase B gene, carbamyl phosphate synthase 1 gene, plasminogen gene, EST51549, albumin gene, cytochrome P450 subfamily 2E1 gene, retinol-binding protein 4 gene and organic anion transporter C gene are underexpressed in hepatocellular carcinoma tissue.

[0133] By measuring the expressions of these genes and determining if the expressions thereof are decreased compared to those of a control, it becomes possible to accurately detect hepatocellular carcinoma. Furthermore, by measuring the expressions of the genes or degree of underexpression thereof, if decreased, diagnosis of hepatocellular carcinoma or early detection thereof can be properly performed. The DNA chip obtained by immobilizing whole or a part of the DNA comprising the transcribed regions of the 8 genes can be used as an effective tool for detecting hepatocellular carcinoma.

[0134] As described above, the technique of the present invention can be effectively used for prevention, diagnosis, or treatment of hepatocellular carcinoma, and analysis for prognosis of chronic hepatitis, etc.

1 8 1 1652 DNA human 1 aaaaacatga tgagaagtct ataaaaattg tgtgctacca aagatctgtc ttatttggca 60 gctgctgcct cacccacagc ttttgatatc taggaggact cttctctccc aaactacctg 120 tcaccatggc ccaccgattt ccagccctca cccaggagca gaagaaggag ctctcagaaa 180 ttgcccagag cattgttgcc aatggaaagg ggatcctggc tgcagatgaa tctgtaggta 240 ccatggggaa ccgcctgcag aggatcaagg tggaaaacac tgaagagaac cgccggcagt 300 tccgagaaat cctcttctct gtggacagtt ccatcaacca gagcatcggg ggtgtgatcc 360 ttttccacga gaccctctac cagaaggaca gccagggaaa gctgttcaga aacatcctca 420 aggaaaaggg gatcgtggtg ggaatcaagt tagaccaagg aggtgctcct cttgcaggaa 480 caaacaaaga aaccaccatt caagggcttg atggcctctc agagcgctgt gctcagtaca 540 agaaagatgg tgttgacttt gggaagtggc gtgctgtgct gaggattgcc gaccagtgtc 600 catccagcct cgctatccag gaaaacgcca acgccctggc tcgctacgcc agcatctgtc 660 agcagaatgg actggtacct attgttgaac cagaggtaat tcctgatgga gaccatgacc 720 tggaacactg ccagtatgtt actgagaagg tcctggctgc tgtctacaag gccctgaatg 780 accatcatgt ttacctggag ggcaccctgc taaagcccaa catggtgact gctggacatg 840 cctgcaccaa gaagtatact ccagaacaag tagctatggc caccgtaaca gctctccacc 900 gtactgttcc tgcagctgtt cctggcatct gctttttgtc tggtggcatg agtgaagagg 960 atgccactct caacctcaat gctatcaacc tttgccctct accaaagccc tggaaactaa 1020 gtttctctta tggacgggcc ctgcaggcca gtgcactggc tgcctggggt ggcaaggctg 1080 caaacaagga ggcaacccag gaggctttta tgaagcgggc catggctaac tgccaggcgg 1140 ccaaaggaca gtatgttcac acgggttctt ctggggctgc ttccacccag tcgctcttca 1200 cagcctgcta tacctactag ggtccaatgc ccgccagcct agctccagtg cttctagtag 1260 gagggctgaa agggagcaac ttttcctcta atcctggaaa ttcgacacaa ttagatttga 1320 actgctggaa atacaacaca tgttaaatct taagtacaag ggggaaaaaa taaatcagtt 1380 attgaaacat aaaaatgaat accaaggacc tgatcaaatt tcacacagca gtttccttgc 1440 aacactttca gctccccatg ctccagaata cccacccaag aaaataatag gctttaaaac 1500 aatatcggct cctcatccaa agaacaactg ctgattgaaa cacctcatta gctgagtgta 1560 gagaagtgca tcttatgaaa cagtcttagc agtggtaggt tgggaaggag atagctgcaa 1620 ccaaaaaaga aataaatatt ctataaacct tc 1652 2 5215 DNA human 2 aagcaacctt aaaatgactg caccctccca gatttctttt acattaacta aaaagtctta 60 tcacacaatc tcataaaatt tatgtaattt catttaattt tagccacaaa tcatcaaaat 120 gacgaggatt ttgacagctt tcaaagtggt gaggacactg aagactggtt ttggctttac 180 caatgtgact gcacaccaaa aatggaaatt ttcaagacct ggcatcaggc tcctttctgt 240 caaggcacag acagcacaca ttgtcctgga agatggaact aagatgaaag gttactcctt 300 tggccatcca tcctctgttg ctggtgaagt ggtttttaat actggcctgg gagggtaccc 360 agaagctatt actgaccctg cctacaaagg acagattctc acaatggcca accctattat 420 tgggaatggt ggagctcctg atactacttc tctggatgaa ctgggactta gcaaatattt 480 ggagtctaat ggaatcaagg tttcaggttt gctggtgctg gattatagta aagactacaa 540 ccactggctg gctaccaaga gtttagggca atggctacag gaagaaaagg ttcctgcaat 600 ttatggagtg gacacaagaa tgctgactaa aataattcgg gataagggta ccatgcttgg 660 gaagattgaa tttgaaggtc agcctgtgga ttttgtggat ccaaataaac agaatttgat 720 tgctgaggtt tcaaccaagg atgtcaaagt gtacggcaaa ggaaacccca caaaagtggt 780 agctgtagac tgtgggatta aaaacaatgt aatccgcctg ctagtaaagc gaggagctga 840 agtgcactta gttccctgga accatgattt caccaagatg gagtatgatg ggattttgat 900 cgcgggagga ccggggaacc cagctcttgc agaaccacta attcagaatg ttcagaagat 960 tttggagagt gatcgcaagg agccattgtt tggaatcagt acaggaaact taataacagg 1020 attggctgct ggtgccaaaa cctacaagat gtccatggcc aacagagggc agaatcagcc 1080 tgttttgaat atcacaaaca aacaggcttt cattactgct cagaatcatt gctatgcctt 1140 ggacaacacc ctccctgctg gctggaaacc actttttgtg aatgtcaacg atcaaacaaa 1200 tgaggggatt atgcatgaga gcaaaccctt cttcgctgtg cagttccacc cagaggtcac 1260 cccggggcca atagacactg agtacctgtt tgattccttt ttctcactga taaagaaagg 1320 aaaagctacc accattacat cagtcttacc gaagccagca ctagttgcat ctcgggttga 1380 ggtttccaaa gtccttattc taggatcagg aggtctgtcc attggtcagg ctggagaatt 1440 tgattactca ggatctcaag ctgtaaaagc catgaaggaa gaaaatgtca aaactgttct 1500 gatgaaccca aacattgcat cagtccagac caatgaggtg ggcttaaagc aagcggatac 1560 tgtctacttt cttcccatca cccctcagtt tgtcacagag gtcatcaagg cagaacagcc 1620 agatgggtta attctgggca tgggtggcca gacagctctg aactgtggag tagaactatt 1680 caagagaggt gtgctcaagg aatatggtgt gaaagtcctg ggaacttcag ttgagtccat 1740 tatggctacg gaagacaggc agctgttttc agataaacta aatgagatca atgaaaagat 1800 tgctccaagt tttgcagtgg aatcgattga ggatgcactg aaggcagcag acaccattgg 1860 ctacccagtg atgatccgtt ccgcctatgc actgggtggg ttaggctcag gcatctgtcc 1920 caacagagag actttgatgg acctcagcac aaaggccttt gctatgacca accaaattct 1980 ggtggagaag tcagtgacag gttggaaaga aatagaatat gaagtggttc gagatgctga 2040 tgacaattgt gtcactgtct gtaacatgga aaatgttgat gccatgggtg ttcacacagg 2100 tgactcagtt gttgtggctc ctgcccagac actctccaat gccgagtttc agatgttgag 2160 acgtacttca atcaatgttg ttcgccactt gggcattgtg ggtgaatgca acattcagtt 2220 tgcccttcat cctacctcaa tggaatactg catcattgaa gtgaatgcca agatgtcccc 2280 gaactctgct ctggcctcca aaacgactgg ctacccattg gcattcattg ctgcaaagat 2340 tgccctagga atcccacttc caggaattaa gaacgtcgta tccgggaaga catcagcctg 2400 ttttgaacct agcctggatt acatggtcac caagattccc cgctgggatc ttgaccgttt 2460 tcatggaaca tctagccgaa ttggtagctc tatgaaaagt gtaggagagg tcatggctat 2520 tggtcgtacc tttgaggaga gtttccagaa agctttacgg atgtgccacc catctataga 2580 gggtttcact ccccgtctcc caatgaacaa agaatggcca tcgaatttag atcttagaaa 2640 agagttgtct gaaccaagca gcacgcgtat ctatgccatt gccaaggcca ttgatgacaa 2700 catgtccctt gatgagattg agaagctcac atacattgac aagtggtttt tgtataagat 2760 gcgtgatatt ttaaacatgg aaaagacact gaaaggcctc aacagtgagt ccatgacaga 2820 agaaaccctg aaaagggcaa aggagattgg gttctcagat aagcagattt caaaatgcct 2880 tgggctcact gaggcccaga caagggagct gaggttaaag aaaaacatcc acccttgggt 2940 taaacagatt gatacactgg ctgcagaata cccatcagta acaaactatc tctatgttac 3000 ctacaatggt caggagcatg atgtcaattt tgatgaccat ggaatgatgg tgctaggctg 3060 tggtccatat cacattggca gcagtgtgga atttgattgg tgtgctgtct ctagtatccg 3120 cacactgcgt caacttggca agaagacggt ggtggtgaat tgcaatcctg agactgtgag 3180 cacagacttt gatgagtgtg acaaactgta ctttgaagag ttgtccttgg agagaatcct 3240 agacatctac catcaggagg catgtggtgg ctgcatcata tcagttggag gccagattcc 3300 aaacaacctg gcagttcctc tatacaagaa tggtgtcaag atcatgggca caagccccct 3360 gcagatcgac agggctgagg atcgctccat cttctcagct gtcttggatg agctgaaggt 3420 ggctcaggca ccttggaaag ctgttaatac tttgaatgaa gcactggaat ttgcaaagtc 3480 tgtggactac ccctgcttgt tgaggccttc ctatgttttg agtgggtctg ctatgaatgt 3540 ggtattctct gaggatgaga tgaaaaaatt cctagaagag gcgactagag tttctcaggc 3600 cacgccagtg gtgctgacaa aatttgttga aggggcccga gaagtagaaa tggacgctgt 3660 tggcaaagat ggaagggtta tctctcatgc catctctgaa catgttgaag atgcaggtgt 3720 ccactcggag aatgccactc tgatgctgcc cacacaaacc atcagccaag gggccattga 3780 aaaggtgaag gatgctaccc ggaagattgc aaaggctttt gccatctctg gtccattcaa 3840 cgtccaattt cttgtcaaag gaaatgatgt cttggtgaat gagtgtaact tgagagcttc 3900 tcgatccttc ccctctgttt ccaagactct tggggttgac ttcattgatg tggccaccaa 3960 ggtgttgatt ggagagaatg ttgatgagaa acatcttcca acattggacc atcccataat 4020 tcctgttgac tatgttgcaa ttaaggctcc catgttttcc tggccccggt tgagggatgc 4080 tgaccccatt ctgagatgtg agatggcttc cactggagag gtggcttgct ttggtgaagg 4140 tattcataca gccttcctaa aggcaatgct ttccacagga tttaagatac cccagaaagg 4200 catcctgata ggcatccagc aatcattccg gccaagattc cttggtgtgg ctgaacaatt 4260 acacaatgaa ggtttcaagc tgtttgccac ggaagccaca tcagactggc tcaacgccaa 4320 caatgtccct gccaacccag tggcatggcc gtctcaagaa ggacagaatc ccagcctctc 4380 ttccatcaga aaattgatta gagatggcag cattgaccta gtgattaacc ttcccaacaa 4440 caacactaaa tttgtccatg ataattatgt gattcggagg acagctgttg atagtggaat 4500 ccctctcctc actaattttc aggtgaccaa actttttgct gaagctgtgc agaaatctcg 4560 caaggtggac tccaagagtc ttttccacta caggcagtac agtgctggaa aagcagcata 4620 gagatgcaga caccccagcc ccattattaa atcaacctga gccacatgtt atataaagga 4680 actgattcac aactttctca gagatgaata ttgataacta aacttcattt cagtttactt 4740 tgttatgcct taatattctg tgtcttttgc aattaaattg tcagtcactt cttcaaaacc 4800 ttacagtcct tcctaaggtt actcttcatg agattcatcc atttactaat actgtatttt 4860 tggtggacta ggcttgccta tgtgcttatg tgtagctttt tactttttat ggtgtgatta 4920 atggtgatca aggtaggaaa agttgtgttc tattttcttg aactccttct atactttaag 4980 atactctatt tttaaaacac tatctgcaaa ctcaggacac tttaacaggg cagaatactc 5040 taaaaacttg ataaaattaa atatagattt aatttatgaa ccttccatca tgtgtttgtg 5100 tattgcttct ttttggatcc tcattctcac ccatttggct aatccaggaa tattgttatc 5160 ccttcccatt atattgaagt tgagaaatgt gacagagcat ttagagtatg aattc 5215 3 2732 DNA human 3 aacaacatcc tgggattggg acccactttc tgggcactgc tggccagtcc caaaatggaa 60 cataaggaag tggttcttct acttctttta tttctgaaat caggtcaagg agagcctctg 120 gatgactatg tgaataccca gggggcttca ctgttcagtg tcactaagaa gcagctggga 180 gcaggaagta tagaagaatg tgcagcaaaa tgtgaggagg acgaagaatt cacctgcagg 240 gcattccaat atcacagtaa agagcaacaa tgtgtgataa tggctgaaaa caggaagtcc 300 tccataatca ttaggatgag agatgtagtt ttatttgaaa agaaagtgta tctctcagag 360 tgcaagactg ggaatggaaa gaactacaga gggacgatgt ccaaaacaaa aaatggcatc 420 acctgtcaaa aatggagttc cacttctccc cacagaccta gattctcacc tgctacacac 480 ccctcagagg gactggagga gaactactgc aggaatccag acaacgatcc gcaggggccc 540 tggtgctata ctactgatcc agaaaagaga tatgactact gcgacattct tgagtgtgaa 600 gaggaatgta tgcattgcag tggagaaaac tatgacggca aaatttccaa gaccatgtct 660 ggactggaat gccaggcctg ggactctcag agcccacacg ctcatggata cattccttcc 720 aaatttccaa acaagaacct gaagaagaat tactgtcgta accccgatag ggagctgcgg 780 ccttggtgtt tcaccaccga ccccaacaag cgctgggaac tttgcgacat cccccgctgc 840 acaacacctc caccatcttc tggtcccacc taccagtgtc tgaagggaac aggtgaaaac 900 tatcgcggga atgtggctgt taccgtttcc gggcacacct gtcagcactg gagtgcacag 960 acccctcaca cacataacag gacaccagaa aacttcccct gcaaaaattt ggatgaaaac 1020 tactgccgca atcctgacgg aaaaagggcc ccatggtgcc atacaaccaa cagccaagtg 1080 cggtgggagt actgtaagat accgtcctgt gactcctccc cagtatccac ggaacaattg 1140 gctcccacag caccacctga gctaacccct gtggtccagg actgctacca tggtgatgga 1200 cagagctacc gaggcacatc ctccaccacc accacaggaa agaagtgtca gtcttggtca 1260 tctatgacac cacaccggca ccagaagacc ccagaaaact acccaaatgc tggcctgaca 1320 atgaactact gcaggaatcc agatgccgat aaaggcccct ggtgttttac cacagacccc 1380 agcgtcaggt gggagtactg caacctgaaa aaatgctcag gaacagaagc gagtgttgta 1440 gcacctccgc ctgttgtcct gcttccagat gtagagactc cttccgaaga agactgtatg 1500 tttgggaatg ggaaaggata ccgaggcaag agggcgacca ctgttactgg gacgccatgc 1560 caggactggg ctgcccagga gccccataga cacagcattt tcactccaga gacaaatcca 1620 cgggcgggtc tggaaaaaaa ttactgccgt aaccctgatg gtgatgtagg tggtccctgg 1680 tgctacacga caaatccaag aaaactttac gactactgtg atgtccctca gtgtgcggcc 1740 ccttcatttg attgtgggaa gcctcaagtg gagccgaaga aatgtcctgg aagggttgtg 1800 ggggggtgtg tggcccaccc acattcctgg ccctggcaag tcagtcttag aacaaggttt 1860 ggaatgcact tctgtggagg caccttgata tccccagagt gggtgttgac tgctgcccac 1920 tgcttggaga agtccccaag gccttcatcc tacaaggtca tcctgggtgc acaccaagaa 1980 gtgaatctcg aaccgcatgt tcaggaaata gaagtgtcta ggctgttctt ggagcccaca 2040 cgaaaagata ttgccttgct aaagctaagc agtcctgccg tcatcactga caaagtaatc 2100 ccagcttgtc tgccatcccc aaattatgtg gtcgctgacc ggaccgaatg tttcatcact 2160 ggctggggag aaacccaagg tacttttgga gctggccttc tcaaggaagc ccagctccct 2220 gtgattgaga ataaagtgtg caatcgctat gagtttctga atggaagagt ccaatccacc 2280 gaactctgtg ctgggcattt ggccggaggc actgacagtt gccagggtga cagtggaggt 2340 cctctggttt gcttcgagaa ggacaaatac attttacaag gagtcacttc ttggggtctt 2400 ggctgtgcac gccccaataa gcctggtgtc tatgttcgtg tttcaaggtt tgttacttgg 2460 attgagggag tgatgagaaa taattaattg gacgggagac agagtgacgc actgactcac 2520 ctagaggctg ggacgtgggt agggatttag catgctggaa ataactggca gtaatcaaac 2580 gaagacactg tccccagcta ccagctacgc caaacctcgg cattttttgt gttattttct 2640 gactgctgga ttctgtagta aggtgacata gctatgacat ttgttaaaaa taaactctgt 2700 acttaacttt gatttgagta aattttggtt tt 2732 4 288 DNA human misc_feature (17)..(17) “n” may be any nucleotide 4 cttatctaaa agagganctn caggtctcaa ccntgccagt cacaccnaat taatgtcctt 60 cacaaaaata ancagcatat gttccctttc aatttgagtt cagtgagctc acagcaaaat 120 ttacctttta attttnttca gcaaatccaa gacgaatata caaaggatga gattagataa 180 agatttcagt ttccngtatg ccaccgntgc cgccaatttt ccaaaaaagc ctggctcctc 240 ttttcctgtt cctccatcca agcccccaaa gatctctaac cagaatta 288 5 2251 DNA human 5 aggatgtctt ctggcaattt catataagta ttttttcaaa aatgtctctt ctgtcaaccc 60 cacgcctttg gcacaatgaa gtgggtaacc tttatttccc ttctttttct ctttagctcg 120 gcttattcca ggggtgtgtt tcgtcgagat gcacacaaga gtgaggttgc tcatcggttt 180 aaagatttgg gagaagaaaa tttcaaagcc ttggtgttga ttgcctttgc tcagtatctt 240 cagcagtgtc catttgaaga tcatgtaaaa ttagtgaatg aagtaactga atttgcaaaa 300 acatgtgtag ctgatgagtc agctgaaaat tgtgacaaat cacttcatac cctttttgga 360 gacaaattat gcacagttgc aactcttcgt gaaacctatg gtgaaatggc tgactgctgt 420 gcaaaacaag aacctgagag aaatgaatgc ttcttgcaac acaaagatga caacccaaac 480 ctcccccgat tggtgagacc agaggttgat gtgatgtgca ctgcttttca tgacaatgaa 540 gagacatttt tgaaaaaata cttatatgaa attgccagaa gacatcctta cttttatgcc 600 ccggaactcc ttttctttgc taaaaggtat aaagctgctt ttacagaatg ttgccaagct 660 gctgataaag ctgcctgcct gttgccaaag ctcgatgaac ttcgggatga agggaaggct 720 tcgtctgcca aacagagact caaatgtgcc agtctccaaa aatttggaga aagagctttc 780 aaagcatggg cagtggctcg cctgagccag agatttccca aagctgagtt tgcagaagtt 840 tccaagttag tgacagatct taccaaagtc cacacggaat gctgccatgg agatctgctt 900 gaatgtgctg atgacagggc ggaccttgcc aagtatatct gtgaaaatca ggattcgatc 960 tccagtaaac tgaaggaatg ctgtgaaaaa cctctgttgg aaaaatccca ctgcattgcc 1020 gaagtggaaa atgatgagat gcctgctgac ttgccttcat tagctgctga ttttgttgaa 1080 agtaaggatg tttgcaaaaa ctatgctgag gcaaaggatg tcttcctggg catgtttttg 1140 tatgaatatg caagaaggca tcctgattac tctgtcgtgc tgctgctgag acttgccaag 1200 acatatgaaa ccactctaga gaagtgctgt gccgctgcag atcctcatga atgctatgcc 1260 aaagtgttcg atgaatttaa acctcttgtg gaagagcctc agaatttaat caaacaaaac 1320 tgtgagcttt ttaagcagct tggagagtac aaattccaga atgcgctatt agttcgttac 1380 accaagaaag taccccaagt gtcaactcca actcttgtag aggtctcaag aaacctagga 1440 aaagtgggca gcaaatgttg taaacatcct gaagcaaaaa gaatgccctg tgcagaagac 1500 tatctatccg tggtcctgaa ccagttatgt gtgttgcatg agaaaacgcc agtaagtgac 1560 agagtcacaa aatgctgcac agagtccttg gtgaacaggc gaccatgctt ttcagctctg 1620 gaagtcgatg aaacatacgt tcccaaagag tttaatgctg aaacattcac cttccatgca 1680 gatatatgca cactttctga gaaggagaga caaatcaaga aacaaactgc acttgttgag 1740 cttgtgaaac acaagcccaa ggcaacaaaa gagcaactga aagctgttat ggatgatttc 1800 gcagcttttg tagagaagtg ctgcaaggct gacgataagg agacctgctt tgccgaggag 1860 ggtaaaaaac ttgttgctgc aagtcaagct gccttaggct tataacatct acatttaaaa 1920 gcatctcagc ctaccatgag aataagagaa agaaaatgaa gatcaaaagc ttattcatct 1980 gttttctttt tcgttggtgt aaagccaaca ccctgtctaa aaaacataaa tttctttaat 2040 cattttgcct cttttctctg tgcttcaatt aataaaaaat ggaaagaatc taatagagtg 2100 gtacagcact gttatttttc aaagatgtgt tgctatcctg aaaattctgt aggttctgtg 2160 gaagttccag tgttctctct tattccactt cggtagagga tttctagttt ctgtgggcta 2220 attaaataaa tcactaatac tcttctaagt t 2251 6 14776 DNA human 6 cccccattga aaaattgtct ttctgatctt tataaacaat tatttaatat ccagtaaaat 60 cttctctata ttgctttact agtgagttct attaaaattt tgaagcacag aaaattcccc 120 tacagtataa agtatcccca gtcacagaga agacaggggt tttgcaatga tttctagaat 180 agtgcaattt ttatgcaaga acctaatata acacaaaaat tatagcccga ttttatttgt 240 gggtatagat gcaaaattac taaaaatact attaacaagt tgaatcctta gggtgttaaa 300 agagtatcac tccatgaacg agttggttgt gatgtggaac tatgaggtac ttttatgata 360 caatataaaa atttatggta attttatggt acattgtgag acagtgtttt cttctagcat 420 catactagca ggtctatgga gaaaaatcac aggattgtct caatcaaaaa aagatttcat 480 taacccaact ctcatccctg ataaacactg ttagttatct agagaaagaa gaaaattgtc 540 ccaatacagt cacctctttg ccacacccag ccaacagcag acgtgatgga agcctgaaga 600 acaccctgcc acgggcacag gcagaggcac aggcaccctg tcgtcctgat tatttcacct 660 tgtcacgggc agaggcacag gcaccctgtc gtcctgatta tttcaccttg tcacaggcac 720 aggcaccctg tcgtcctgat tatttcacct tgtcacaggc acaggcactc tgtcgtcctg 780 attatttcac cttgtcacgg gcagaggcac aggcactctg tcatcctgat tatttcacct 840 tgtcctagag tgtcctgcca atgggacaga tgcaaaacaa ataaaagccc cggcttctga 900 aaagaagcac acagaaatgt cattattttc aaacgaggtg ttcccgtata taaaatttga 960 tgttggttgg gcatctaaca gtattatggc cagaggactc agaccacagc tgcatccctg 1020 tgaggcacag actctccagg gcacgcgggt cccgctggga tgtgcacact caggtgagct 1080 gcacagacaa ggtgtcctca gcccagggga gccagaggcc tgctctgcct ctccaccctg 1140 atgcttcctg ttctcacccc accaaagcca aggcttcaat ttcagtctgt ggggagctga 1200 ctctgctgct ctcaagcact agaagaagga accagtaatc gaggaaactt gtggacccca 1260 atggtgtctg tcccggccag gcctggctgg gcccacacag gacaacaggg ttcaggggtc 1320 tggacagctg tttctgccca gggaattgtc cctgccacct cacactggcc actggaaagg 1380 aaagagagga ggaggcggca ggctaaccca cccgtgagcc agtcgagtct acattgtcag 1440 ttctcacctc gaggggtgcc aaaaaccaga gggaagcaaa ggcccctgaa gcctctgcca 1500 gaggccaacg ccccttcttg gttcaggaga ggtgcagtgt taggtgcagc acaaccaatg 1560 acttgcttat gtggctaata aattgtcaag agaaaaactg ggttagaatg caatatatag 1620 tatgtagtct catttttgta taaatacaag tatagaatgg cataactcaa aatccacaag 1680 tgatttggct ggattgtaaa tgacttttat tttcttcatt tctcatcata ttttctatta 1740 tacataaaga ttcattgtta atataaaagt acaaaattgc aacctatgaa ttaagaactt 1800 ctatatattg ccagttagaa gacagaatga aaaacattct cttcattcta accacacaca 1860 caaaaaactc cacaaaatac ctatggacta ccttcataga aggtggaaga gggtctgtat 1920 gaagaaaatg cttaatacat gaaagaagaa gctagtcaat gtggaggtct attgtgcgcc 1980 gggatcaaca aagacaagat atgtttaaaa tggtgttcta aatttaccct aatgtaaaac 2040 aaatccaata aaactctaat gtgatttttt aagaatttaa atttggaata attccaaaga 2100 acaatttttc ttaatttcta cagccagaat atataccttt aaaaaaaatg aaaacagaga 2160 ttaactttct cagaattggt tgactcactc tttcctttta tttttcttcc atggaatttt 2220 ccagttaact tgagaaagtg gaatcgaatt ccgatgttga attttccttc tggccccatt 2280 catgtggcag gtggtgattc aggtactact gggggctgct cagacaaacc tcctcatcag 2340 acatcaagag gctgttgcac caggagggcc ggtaccgtgt ctagaggtgg tcggcatggg 2400 gttggagttg tattacataa accctactcc aaacaaatgc atggggatgt ggctggagtt 2460 ccccgttgtc taaccagtgc caaagggcag gtcggtacct caccccacgt tcttaactat 2520 gggttggcaa catgttcctg gatgtgtttg ctggcacagt gacaggtgct agcaaccagg 2580 gtgttgacac agtccaactc catcctcacc aggtcactgg ctggaacccc tgggggccac 2640 cattgcggga atcagccttt gaaacgatgg ccaacagcag ctaataataa accagtaatt 2700 tgggatagac gagtagcaag agggcattgg ttggtgggtc accctccttc tcagaacaca 2760 ttataaaaac cttcctttcc acaggattgt cctcccgggc tggcagcagg gccccagcgg 2820 caccatgtct gccctcggag tcaccgtggc cctgctggtg tgggcggcct tcctcctgct 2880 ggtgtccatg tggaggcagg tgcacagcag ctggaatctg cccccaggcc ctttcccgct 2940 tcccatcatc gggaacctct tccagttgga attgaagaat attcccaagt ccttcacccg 3000 ggtaagagaa atagtgttga ttttagggag aataactcag caattggatc tggtatgtgt 3060 gtattcaact catttgcaga caaattgtgg ttgttcaata ccagcctgtt gtgaattacc 3120 tgaattgata gcatcctgga gcgacactca aaatgtgtcg cctgtggtgc agctggagcc 3180 cggagcctgc gtgccaggcc ccggaggccc ccgccgtgcc ttgtcctggg gctgatgatg 3240 gggaggccgg cgaggccggg ctgctgcgac gccaggataa ccgggctggc ggccagatgc 3300 gcactcgctg ggcgtccgcc tgtgtttgcc aaagcacgag ttgaaacgtg aagtgttggg 3360 ccagcccgtg tggcaccaat acctgccgcc tacgactgtt gtgaacactg aatgggccaa 3420 caaacctaaa cgttaaatga actgataacg ccgtcagcac ggagcaggcg ctgggtgttt 3480 gcgctcttgc gcgtgcgctg ctgtggggcg caggctgacg gcgggcgggg gtcgcctgct 3540 ccagctcggg ctcccgcgcc agaaccgggt ccagaacctt gattccggaa gcgggcaacg 3600 gggtggttgg tgggcgcgcc tgagggaagg gacgtgagga gccggagtcc gcggagttgc 3660 cgcggagttg tccgcggagt ccaggcgggt ggggagcaga gcagctggaa ccccccgagc 3720 gccctgcaga cgcagcagcc tcttgagggg agggtctccc ccacctcggg ctggacaaag 3780 acagcttttc cccacgtccc tctgggttct ctagagcaac agcaataccc gcccggcagg 3840 tgtggcttag agccccgcac ctcctcgccg cgcgcgggcc tgacttctag ccacgggtct 3900 ccgcagttgg cccagcgctt cgggccggtg ttcacgctgt acgtgggctc gcagcgcatg 3960 gtggtgatgc acggctacaa ggcggtgaag gaagcgctgc tggactacaa ggacgagttc 4020 tcgggcagag gcgacctccc cgcgttccat gcgcacaggg acaggggtga gtccgcgtcc 4080 ctggcacgga gcggggggtg cataacacgc cccgggacag ttacgggcgc tagccacgtc 4140 ggcgatggcc aaataataaa ctaacagtaa tattatagta atagcatccg aaggatgaga 4200 tcaggattag gcgatggccc ccgcgcgttg cctgccgagc gaggcgcact gagtcgccca 4260 ggaatccggc ctctcggcga ctgtgcggga gagttttatg gggatgggcg gggctgcttc 4320 tgagcaggag tcgccgcccc cacccccacc gttccgcctc tgggccgcag gctcctcccg 4380 ggagcgcttt cccctcctgt tcaaccgccg gggtacaggt ggcttcgtcc accgaggtcc 4440 cctcacccac gctgaggcgt cggaagctgc ggacactgct cgcttcaggg ctttgctcag 4500 ctgcagctgg tgacctccag agagggagtc tctgatgtcc cgctggggtg gatgtcctga 4560 gaccgggaag ggggaagaga cccactgaaa tcctatctcc cagcctcacc tctgctgtct 4620 cctccacgct tcctgtctcc agagccccga gttcagcata agcagaaagc ggcctgttcc 4680 ctctctaggg agaggagggt tgcggtctgg aggtctggct cgtctttatc tgcgcattct 4740 cccagcctcc tggcttcaga cctcagcgag gcggcggctg cggccggctc tcctcttcct 4800 gcctgcagac ctggcctgct gcttctttct ccttcctccc tccctgcctg ccctgcggtt 4860 tcaaagtaga ttagaaataa cagtgtccca catggaagcc tctacttctt cctgggtcaa 4920 ctttgatgac gaggctccag aaaacctttg caatgctgtg tggaattttt aaatcggtga 4980 gctcgtgctc ttgccctatt tatttgtcca gcgtacattt ctgaacattg tgaacgtcga 5040 atgggccaac aaatctaaaa attaaatgag ctgataaaga acgccgtcag cacagagcag 5100 acgctgggtg ttcgcgctct tgagcgtgcg ctctgcgggg cgcgggctgg tggcgggcgg 5160 gggtcgccgg ctccagctca ggttcccgcg ccaggaccgc gtccagaacc ttgtctccgg 5220 aagcgggcaa cggggtggtt gtatcacaat tagtggcatt tggttttcct tcttctgcat 5280 tgtgggtttt acttctctgg ggttgccaaa aacaaaatta accatctcag tccttgtcgt 5340 taacgcagga gaagcattac tggaggaggc tctggggttc tgtggttgag gagctcagtt 5400 ctggttccgg ggagccctta tctgccaccc acgggtccaa ggcacagtcg gaggcagcag 5460 ggaggggagc ggaattcaca tcaacacaga tggggctcaa ggggactttg ctgcctctgc 5520 ctggagggtc taaagtttca ttttcatatg acccgcaggg cgcagactgg cggaaaatta 5580 gcagagccct gggcatgggc tgcacctggc cttaagggac aatgatggaa atattcctta 5640 ttagcacaat actgagcaca ggctgtgtga taatgtgtca agggaactgc agacatcctt 5700 tcagaaaaag ttcataaaac ggagaaagtt tggttcccaa cctagatttt taacctgttg 5760 aactctgtct aaatgggtca tctcgggatg tcctccactc aacatgacca cagtctgccc 5820 ctctgtccca cctgtctcct cagtccttcc tccccacctt tcaggatgaa atgaaaccct 5880 cagtccagct gcacccctgc cccacccacc tcatctcatg tgccctcccg cccctctcag 5940 gccggacagc cttgcttctg gaacacacga gcacagcttc accaggcact ttctgagcac 6000 cctgcaggcg cctcccagga gtggtcagtg gtcaatcagc taatgaagct gcataggaca 6060 tgacccttgt ttaccgcaga atgcccagag ctggcaggat gtcttatatg caggaagtac 6120 ccaaaatgta tttattgagg aagtgatgat ggataagagg aagacggaga gcgagggaga 6180 gaggggctag gggccctgcg gtgtaaaggg ggtgtggctg ggagtgtgca ggggaacagg 6240 gatcatttca aggttcctat ctgggagaaa ataaaaaggt ttacagttag ttgagataag 6300 cgtgggaata tgcgaacatt tttaaagaat aaaaagttta gctttaaatt tgttgattcc 6360 aaatgtgttc atactctcgg gaggatccat caagcaactc ttgggaggag agacagggca 6420 gggcaggcct tgacagctca gaagggcgca gtagggacag ttcttggttt tcccagctct 6480 gatgctttgc acagtcgctt gtgtgacctg caagatttta gtgaagaaac ttgctgtgga 6540 gtcggaaagc tgcaagttga ggtgtgtgtg gtgtgagggt taaaaatctg tgagaacaga 6600 atgaatggct tttcaagaat gttgtcgata gataggaaag aggtgggagg tgttcttgga 6660 gtggccatat gtggttttat gtagcatggg gaagactcag cagaaaggaa aaagaaagaa 6720 ggtaaattga cagcatgaag tagagcaccc aggagaggct acatgtgatg aagaaaccac 6780 agtgcagact gtgaggaccc cagaaaggct cctccccaaa acctgaccag tggccggtgc 6840 tggcagctcc caggctggga caccctctgt ctctctgtcc ctctgccccc tctgtcactt 6900 ctttatacac ctgtaaatcc tgccctgctc tccaaggccc tctgtagccc atttctcccc 6960 aaaatgggta tttagaataa ccttctgctg gcccctctgc cttaggaatc atttttaata 7020 atggacctac ctggaaggac atccggcggt tttccctgac caccctccgg aactatggga 7080 tggggaaaca gggcaatgag agccggatcc agagggaggc ccacttcctg ctggaagcac 7140 tcaggaagac ccaaggtgcg tatctgctgc ctagcagggc ccagtcctct tgcagaccag 7200 cggtgtgggg agccctggct gggactccta gactgcatct gaaccacagg gacctacgga 7260 caaggagagg gtctcgtgag tccccagata ctgcatttta caactctagg ttccagctac 7320 acagttcagg gagcaagggt ggccattaaa cacgtgactt gtatcctaaa tactgttgaa 7380 aagcaaagga aactcaaaca ggttcagaca ttcactatct ttcgtaaact ggcagttttc 7440 agggcacctt ctcacaggcc ttggtgaacc tcagtgggtg actgagcagg tggaggagtc 7500 tcctcacccc catcttctgg ttgccctgac tgcctgtttt gtaggccagc ctttcgaccc 7560 caccttcctc atcggctgcg cgccctgcaa cgtcatagcc gacatcctct tccgcaagca 7620 ttttgactac aatgatgaga agtttctaag gctgatgtat ttgtttaatg agaacttcca 7680 cctactcagc actccctggc tccaggtgaa gccactttcc tctttcatca gtcatcaact 7740 gtagagttta cgttagaaaa agaaggaaaa tttgggttat atgtgataga caggactgca 7800 aaagccaaac aacatagctt cgaggggtgt ttgattagac agcccaaata ttcctcccag 7860 agacatctct ggggccccac gcaccccctt tcctaacgtc aggatgtgta tcgacctgtg 7920 tgtgcacatt tgccatgcag agtttgcact gctgaggaga atggtgccca agaaggacac 7980 tgttgaccca aaatattcca aataaacaat gattacagcc acaaattcag gtttggagaa 8040 agttgttggt ccaacacaca caattatgtt gcatccagaa aaaagtagta aaatattttt 8100 ttccctctct agctttacaa taattttccc agctttctac actacttgcc tggaagccac 8160 agaaaagtca taaaaaatgt ggctgaagta aaagagtatg tgtctgaaag ggtgaaggag 8220 caccatcaat ctctggaccc caactgtccc cgggacctca ccgactgcct gctcgtggaa 8280 atggagaagg taggctcggc ctcccatgat gtgggctctc cggggtgggc agagaatgca 8340 caatttcaga tttacagagt gagctgcact tgctggtgtc cagacctccc accgcagcat 8400 gctctgagtt tcatacacac actcttggct tcagcatgac cactggacgc aagtcagcct 8460 gcctggctgc caagctggcc tggggtttgg ggcacatggg cgggacgctt agctctctcc 8520 aggccctgct gctcaaccct ttctagtctg cagactttga gaattgcatt ttgtctgagg 8580 agaagccctc agccttcctt gtgggcatgc actccccaac tgtgcgcacg tgcaggactt 8640 ccaggcctcc ccagcttcat ccacctgcag gtgctcagga tcctgatccc ctgccccctt 8700 cccaccttgg tgaaacttct tgtatccttg tcttgtcctt tcctatggct tgtggctcaa 8760 gaacaaatgt ggagcccaca ctgatttccc aggactgtct gagcatcttc tccaccagtt 8820 tggcccctcg tggcagcaga cactagccct gtagcaggag gggttagcag gagccgttta 8880 gctcctgcct gagctatgac caaggtcagg gggatctcac ctctcccagg atggccctca 8940 tgctgtggag ggagacagag ccctggcctg ccctcagcag atttctggga gcctcagttt 9000 ccctggctgt gagtggagat gactctgtct gtcacagctc caagtcacag ttccactggg 9060 agagcctctt ggacactgtc tcctgtgtcc ctgtggagct gggaggtggc tggttctgtg 9120 ctgaaaggag acaagcagcc ccttctctcc ggtctgtctc cggtatcaca ggaaaagcac 9180 agtgcagagc gcttgtacac aatggacggt atcaccgtga ctgtggccga cctgttcttt 9240 gcggggacag agaccaccag cacaactctg agatatgggc tcctgattct catgaaatac 9300 cctgagatcg aaggtaggca agtgactgaa gggacaccgt gcgtgcggct gcatctccct 9360 ggatggccag ccttgcacat tttaggctgc agctttctgt ctgaagctgc ttgttaaccc 9420 tcatggtgat gtggtgagat ggctggatgc actgctgtga ggggaggtgt tatggtctgt 9480 gctgaacact ggtactcttg cacactggtt ggtccatacc ccactaagac acccctggtt 9540 gcagaaaaga acatcccaac accagagtgg agagaggtgg cagggtctgc attctgctcc 9600 ataaataacc tctttatgac agagaagata atgtcccagt tccccccaag taagacctgg 9660 tcttctaggc agagcaggtg gggaggttgg agctggaggg gagggtcctt gctggggcgt 9720 cttcctcaaa tgcggacgtg aggagggaag tccaggaaga agcagctaca gctccccctg 9780 gacccttgtc gttccttcca cagggctcct cccagcggca cctggggcag ctgggactct 9840 gtgcctggag gaggtgtgaa aggtctgggt ctaggtgggc agagggtcat gccctgagaa 9900 acacccatct gggccaagta gaggtgatgt gagggcaccg catgcaaaca ggccagtcag 9960 ggttgggtcc aagtaaaggg gaggaaaggg agctgcagcc tggctggaga gtgccggggg 10020 gcccagagcc cctgcctctc gctgggctgg aaacagggct gggcagcctc tgcccgaggc 10080 agttcacagc ctgagtggtg tgtgccgccc tcctcctgaa gctgctgcta atggtcactt 10140 gtggtcttaa ggctcgtcag ttcctgaaag caggtattat aggctatgaa gttatttccc 10200 ccaagaaagt cgacatgtga tggatccagg gtcagaccct ggcttttctt gttctttcct 10260 tcttcttctt ctttttattt atttattttt tttttgaggg gacagggtct cactctgttg 10320 cccaggctgg agtgcggtga tgcaatcatg gctcattgta gcttctacct attgggctca 10380 agcgatcctc ccacctcagc ctcccaagta actgggccac aggtgcacac caccacaccc 10440 agctgattaa aaatttaaaa aaattatttt ggctgggcac agtggctcat acctgtaatc 10500 ctggcacttt gggaggctga ggcaggcgat cacgaggtca ggagttcgag accttcctgg 10560 ccaacatgat gaaaccctgt ctctcctaaa atacaaaaaa gtagccgggt gtggtggcac 10620 gcgcctatag tcacagctac tcaggaggct gaggcaggag aatcgcttca acctcagagg 10680 cacagggtgc agtgatccga gattgcaccc cactgcactc tagcctgaca acagagcaag 10740 aatcagtcta aaaaaaaaat tgtagagaca agttgttact atgttttgta ggctggtctt 10800 gaactcctgg gctcaagtca tcctcctgcc ttggcctccc aaagtgctgg ggttacaggt 10860 gtggccaccg tgccccatcc ctggcctttg ctttttcaat cacatggaaa tgtgaagggt 10920 gaaggagcca aaagtttagg gaaggaatca ttgtatggat ctgcagtgat tataagagaa 10980 ctttcgacta ctctgcacta ggggaaccat ggaatcaaaa aatgttttaa attattattt 11040 atgaggaggt tccaatatag acaaaaggaa aataaatatg attgacatgt atatatccat 11100 tgccaaattg aacgtttatt aacattttgc gatacttcca tcagagctct taaaaagaaa 11160 atgtgttaca gagccagcca aagtctacct cctcacatct ccccacctct ctcaccagaa 11220 atggcttcag aattgctgtg tggctttgca cttttaacag ttgttaatta tcagcacagt 11280 attcatatta ttgctgtatg tgtttaatat tttacctggg tactgtacat aacattttgc 11340 agcttggttt tttcactcaa catatgatga tgttccatgg gaactccaaa cacggggagg 11400 ctaggcgact tgctcaaggc agctgttacc tctgtcagaa agacagaggc tttcagattc 11460 aagaagtaga ccctgcatgt ctgattctgt tctgtaaacc cccttcatac tcagaagcat 11520 gcaataaaca agcctggggt aattatcaat gcaaaggtta ccctcccaga agaaatttcc 11580 aaaacacttt cattattctc tgctcttgac atgaagagaa ctgaataagc catcatcaac 11640 tgagataatg gatgccaaaa catccagtaa ataacctcat agagcttagc tctcactaag 11700 tttttggagc attttccagt aattcaaagg acctggggaa ccttaagcac tgcttaggat 11760 gctccataaa catcttctgc gtgggtaggg gagtggatgg atggctggat gggtgggtgg 11820 atggacggac ggatggatgg atggatggat ggatggatgg ttggatggat gggtgggtgg 11880 atggatggat gggtcaatgg atgtgtggat ggatggaagg gtgggtggat gggtggatgg 11940 ctggctggtt gggtgggtgg gtggatggat gcatgggtgg atggatggag gatggatgga 12000 tggatggagg ggtgtataga tggaggggtg gatggatgtg taggtgggca gatggataaa 12060 agcgtgattg aatagatggg tggatgatgg gtggatgccc aactggccag gaaccaatcc 12120 ctgaaatttg tcccattcat atcttggcag agaagctcca tgaagaaatt gacagggtga 12180 ttgggccaag ccgaatccct gccatcaagg ataggcaaga gatgccctac atggatgctg 12240 tggtgcatga gattcagcgg ttcatcaccc tcgtgccctc caacctgccc catgaagcaa 12300 cccgagacac cattttcaga ggatacctca tccccaaggt taagcaatga gcctgcagca 12360 cacagcatga acaccatcct atcactaatc gccttcctgc cagggagcag gatgggggcc 12420 ccaagaccct tccctttggc aggggtcact gaggggaagg gctggcccca ctcccaccct 12480 gtgggatact gcatctccag gagtgctcac attggcctgg tgaccagaga ggtggaggaa 12540 atctggaaaa gagcctcagc agatagtgcc tgggactgta gtgaattcta atgccaggaa 12600 caaactatca caaccagccc tggggttaat cctgtgagaa gattagggct ttcatcttca 12660 tttagacctg acccctgact gctttctatc taatccttca ctaagcaact ccttcaactc 12720 gaaatatact atcctatata gcataatatt caaaacaaca ttcttcactg ggggtttcca 12780 gatgaaagcc cacattttgt taacatgact cactgagaca gtctttgttt ctcctagggc 12840 acagtcgtag tgccaactct ggactctgtt ttgtatgaca accaagaatt tcctgatcca 12900 gaaaagttta agccagaaca cttcctgaat gaaaatggaa agttcaagta cagtgactat 12960 ttcaagccat tttccacagg tgagaaagat cagaggcagt accttccctt gaggagcagc 13020 ccacactcct catctcccct ccacatgtgc tctgccctcg tcccaggcac ccactgacac 13080 cccaaacctc actgtgtgcc ctgtttctat tgacaacatg acccaaatgt gctcttccct 13140 gttcagagaa gttacataac atcttttagc agcaatcctg ggaatgaagt gttgtaggtg 13200 gatttttttt ttcccaaaga ctagacattt tacatcattc attgctaaat tttgtttcta 13260 ttttaacaag acttagtgaa aagctctcaa agccatatta cccaattctc cctaatttta 13320 aaccagagct actaaacaaa acctaacctt tggttaccta gaatcatcac aggaagcatc 13380 aaagccttcc tgggatgtga ctcagtgatt ttctttgagg cacttgtcct ccttcccagg 13440 gcctcatctt agggattgtt gtgggaagat catacaacca actccatact tttcacaccc 13500 agtgctggag ccccagcttc taacagggca ctatttccct cctgtaggca tcactgatga 13560 gcactggggg tgccttcttt actgggcaga catggtcttc ccaacttaac accggttttt 13620 gcagttgagc tctggataat tgagattgta tgaaggctgg tccccgaatt agtcagtgtc 13680 gctggtatcc ttccactcaa gtacattttg tgcttctttt aataggcaga gaggggtgag 13740 tcctgccctg tgatggccgt ttgcccacag cctcctcctc cccgcttccc ctagtctcac 13800 tgttaacagt gtcgtgtctc tgaaactccc tcagtgtctc atcaatacca ttgttacttc 13860 taggaaaacg agtgtgtgct ggagaaggcc tggctcgcat ggagttgttt cttttgttgt 13920 gtgccatttt gcagcatttt aatttgaagc ctctcgttga cccaaaggat atcgacctca 13980 gccctataca tattgggttt ggctgtatcc caccacgtta caaactctgt gtcattcccc 14040 gctcatgagt gtgtggagga caccctgaac cccccgcttt caaacaagat ttcgaattgt 14100 ttgaggtcag gatttctcaa actgattcct ttctttgcat atgagtattt gaaaataaat 14160 attttcccag aatataaata aatcatcaca tgattatttt aactatatgt taagtcatgg 14220 aatatcttaa ttgtttaagt gattctcaca gagaggtttt tttttttttt tttttttttt 14280 tgagagtttt gctcttgttg accaggatgg agtgcagtgg catgatcttg gctcactgca 14340 acctctgtgt cctgggttca agtgattctc ctccctcagc ctcccgaata gctgggatta 14400 caggcaccca ccaccatgcc agctaattct ttgtattttt agcagagaca gggtttcacc 14460 atgttggtca ggctggtctt gaacccctga cctcaggtga tccacctacc tcggcctccc 14520 aaagtgctgg gattacagca tgagccaccg cgcccagcca gagagaggtt ttaaatatat 14580 atgtttactt taatattaag ttataacata attttcatgt tattgaaaag ctcttccatc 14640 taggatcaca ccacttcagt gtcagaatca tattgaggtg gggaatttgt attagtcagg 14700 tttctctaaa gggacagaaa caataggata gatgtatata cgaaagggag tttattagga 14760 gaattgactc acatga 14776 7 882 DNA human 7 cggccaggct tgcgcgtggt tcccctcccg gtgggcggat tcctgggcaa gatgaagtgg 60 gtgtgggcgc tcttgctgtt ggcggcgtgg gcagcggccg agcgcgactg ccgagtgagc 120 agcttccgag tcaaggagaa cttcgacaag gctcgcttct ctgggacctg gtacgccatg 180 gccaagaagg accccgaggg cctctttctg caggacaaca tcgtcgcgga gttctcggtg 240 gacgagaccg gccagatgag cgccacagcc aagggccgag tccgtctttt gaataactgg 300 gacgtgtgcg cagacatggt gggcaccttc acagacaccg aggaccctgc caagttcaag 360 atgaagtact ggggcgtagc ctcctttctg cagaaaggaa atgatgacca ctggatcgtc 420 gacacagact acgacacgta tgccgtacag tactcctgcc gcctcctgaa cctcgatggc 480 acctgtgctg acagctactc cttcgtgttt tcccgggacc ccaacggcct gcccccagaa 540 gcgcagaaga ttgtaaggca gcggcaggag gagctgtgcc tggccaggca gtacaggctg 600 atcgtccaca acggttactg cgatggcaga tcagaaagaa accttttgta gcaatatcaa 660 gaatctagtt tcatctgaga acttctgatt agctctcagt cttcagctct atttatctta 720 ggagtttaat ttgcccttct ctccccatct tccctcagtt cccataaaac cttcattaca 780 cataaagata cacgtggggg tcagtgaatc tgcttgcctt tcctgaaagt ttctggggct 840 taagattcca gactctgatt cattaaacta tagtcacccg tg 882 8 2452 DNA human 8 gtggacttgt tgcagttgct gtaggattct aaatccaggt gattgtttca aactgagcat 60 caacaacaaa aacatttgta tgatatctat atttcaatca tggaccaaaa tcaacatttg 120 aataaaacag cagaggcaca accttcagag aataagaaaa caagatactg caatggattg 180 aagatgttct tggcagctct gtcactcagc tttattgcta agacactagg tgcaattatt 240 atgaaaagtt ccatcattca tatagaacgg agatttgaga tatcctcttc tcttgttggt 300 tttattgacg gaagctttga aattggaaat ttgcttgtga ttgtatttgt gagttacttt 360 ggatccaaac tacatagacc aaagttaatt ggaatcggtt gtttcattat gggaattgga 420 ggtgttttga ctgctttgcc acatttcttc atgggatatt acaggtattc taaagaaact 480 aatatcaatt catcagaaaa ttcaacatcg accttatcca cttgtttaat taatcaaatt 540 ttatcactca atagagcatc acctgagata gtgggaaaag gttgtttaaa ggaatctggg 600 tcatacatgt ggatatatgt gttcatgggt aatatgcttc gtggaatagg ggagactccc 660 atagtaccac tggggctttc ttacattgat gatttcgcta aagaaggaca ttcttctttg 720 tatttaggta tattgaatgc aatagcaatg attggtccaa tcattggctt taccctggga 780 tctctgtttt ctaaaatgta cgtggatatt ggatatgtag atctaagcac tatcaggata 840 actcctactg attctcgatg ggttggagct tggtggctta atttccttgt gtctggacta 900 ttctccatta tttcttccat accattcttt ttcttgcccc aaactccaaa taaaccacaa 960 aaagaaagaa aagcttcact gtctttgcat gtgctggaaa caaatgatga aaaggatcaa 1020 acagctaatt tgaccaatca aggaaaaaat attaccaaaa atgtgactgg ttttttccag 1080 tcttttaaaa gcatccttac taatcccctg tatgttatgt ttgtgctttt gacgttgtta 1140 caagtaagca gctatattgg tgcttttact tatgtcttca aatacgtaga gcaacagtat 1200 ggtcagcctt catctaaggc taacatctta ttgggagtca taaccatacc tatttttgca 1260 agtggaatgt ttttaggagg atatatcatt aaaaaattca aactgaacac cgttggaatt 1320 gccaaattct catgttttac tgctgtgatg tcattgtcct tttacctatt atattttttc 1380 atactctgtg aaaacaaatc agttgccgga ctaaccatga cctatgatgg aaataatcca 1440 gtgacatctc atagagatgt accactttct tattgcaact cagactgcaa ttgtgatgaa 1500 agtcaatggg aaccagtctg tggaaacaat ggaataactt acatctcacc ctgtctagca 1560 ggttgcaaat cttcaagtgg caataaaaag cctatagtgt tttacaactg cagttgtttg 1620 gaagtaactg gtctccagaa cagaaattac tcagcccatt tgggtgaatg cccaagagat 1680 gatgcttgta caaggaaatt ttactttttt gttgcaatac aagtcttgaa tttatttttc 1740 tctgcacttg gaggcacctc acatgtcatg ctgattgtta aaattgttca acctgaattg 1800 aaatcacttg cactgggttt ccactcaatg gttatacgag cactaggagg aattctagct 1860 ccaatatatt ttggggctct gattgataca acgtgtataa agtggtccac caacaactgt 1920 ggcacacgtg ggtcatgtag gacatataat tccacatcat tttcaagggt ctacttgggc 1980 ttgtcttcaa tgttaagagt ctcatcactt gttttatata ttatattaat ttatgccatg 2040 aagaaaaaat atcaagagaa agatatcaat gcatcagaaa atggaagtgt catggatgaa 2100 gcaaacttag aatccttaaa taaaaataaa cattttgtcc cttctgctgg ggcagatagt 2160 gaaacacatt gttaagggga gaaaaaaagc cacttctgct tctgtgtttc caaacagcat 2220 tgcattgatt cagtaagatg ttatttttga ggagttcctg gtcctttcac taagaatttc 2280 cacatctttt atggtggaag tataaataag cctatgaact tataataaaa caaactgtag 2340 gtagaaaaaa tgagagtact cattgtacat tatagctaca tatttgtggt taaggttaga 2400 ctatatgatc catacaaatt aaagtgagag acatggttac tgtgtaataa aa 2452 

1. A method for detecting hepatocellular carcinoma comprising the step of: (a) measuring, in a tested tissue, the expression level(s) of at least one gene selected from the group consisting of plasminogen gene, EST51549, retinol-binding protein 4 gene and organic anion transporter C gene; and (b) comparing the expression level(s) of the gene(s) measured in (a) with the expression levels of the genes in a control that correspond to the genes measured on step (a).
 2. A method for detecting hepatocellular carcinoma comprising the step of: (a) measuring, in a tested tissue, the expression level(s) of at least one gene selected from the group consisting of plasminogen gene, EST1549, retinol-binding protein 4 gene and organic anion transporter C gene, and at least one gene selected from the group consisting of aldolase B gene, carbamyl phosphate synthase 1 gene, albumin gene and cytochrome P450 subfamily 2E1 gene; and (b) comparing the expression levels of genes measured in (a) with the expression levels of genes in a control that correspond to the genes measured in (a).
 3. A method for detecting hepatocellular carcinoma according to claim 1, wherein the step (a) of measuring the expression level(s) of the gene(s) is performed by determining the amount of transcripts of the genes being measured.
 4. A method for detecting hepatocellular carcinoma according to claim 1, wherein the step (a) of measuring the expression level(s) of the gene(s) is performed by amplifying whole or a part of the DNA to be measured and using cDNA prepared from gene transcripts as a template.
 5. A method for detecting hepatocellular carcinoma according to claim 1, wherein the step (a) of measuring the expression level(s) of the gene(s) is performed by invader assay.
 6. A method for detecting hepatocellular carcinoma according to claim 1, wherein the step (a) of measuring the expression level(s) of the gene(s) is performed by hybridizing labeled cDNA prepared from transcripts including the gene(s) to be measured with whole or a part of the immobilized DNA of the gene(s) to be measured.
 7. A method for detecting hepatocellular carcinoma according to claim 1, wherein the tested tissue in the step (a) is liver tissue of a chronic hepatitis patient.
 8. A method for detecting hepatocellular carcinoma at an early stage that comprises the step of periodically measuring the expression level(s), in a tested tissue, of at least one gene selected from the group consisting of plasminogen gene, EST51549, retinol-binding protein 4 gene and organic anion transporter C gene.
 9. A method for detecting hepatocellular carcinoma at an early stage that comprises the step of periodically measuring the expression level(s), in a tested tissue, of at least one gene selected from the group consisting of plasminogen gene, EST51549, retinol-binding protein 4 gene and organic anion transporter C gene, and at least one gene selected from the group consisting of aldolase B gene, carbamyl phosphate synthase 1 gene, albumin gene and cytochrome P450 subfamily 2E1 gene.
 10. A DNA chip for detecting hepatocellular carcinoma in which whole or a part of DNA comprising transcribed region(s) of at least one gene selected from the group consisting of plasminogen gene, EST51549, retinol-binding protein 4 gene and organic anion transporter C gene is immobilized.
 11. A DNA chip for detecting hepatocellular carcinoma in which whole or a part of DNA, in a tested tissue, comprising transcribed region(s) of at least one gene selected from the group consisting of plasminogen gene, EST51549, retinol-binding protein 4 gene and organic anion transporter C gene, and, at least one gene selected from the group consisting of aldolase B gene, carbamyl phosphate synthase 1 gene, albumin gene and cytochrome P450 subfamily 2E1 gene. 